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Abstract —Generalized spatial modulation (GSM) uses nt 
transmit antenna elements but fewer transmit radio frequency 
(RF) chains, Urf. Spatial modulation (SM) and spatial multi¬ 
plexing are special cases of GSM with Ti'p^ — 1 and — rxt, 
respectively. In GSM, in addition to conveying information bits 
through Tirf conventional modulation symbols (for example, 
QAM), the indices of the active transmit antennas also 
convey information bits. In this paper, we investigate GSM for 
large-scale multiuser MIMO communications on the uplink. Our 
contributions in this paper include: (i) an average bit error 
probability (ABEP) analysis for maximum-likelihood detection in 
multiuser GSM-MIMO on the uplink, where we derive an upper 
bound on the ABEP, and (ii) low-complexity algorithms for GSM- 
MIMO signal detection and channel estimation at the base station 
receiver based on message passing. The analytical upper bounds 
on the ABEP are found to be tight at moderate to high signal-to- 
noise ratios (SNR). The proposed receiver algorithms are found to 
scale very well in complexity while achieving near-optimal perfor¬ 
mance in large dimensions. Simulation results show that, for the 
same spectral efficiency, multiuser GSM-MIMO can outperform 
multiuser SM-MIMO as well as conventional multiuser MIMO, 
by about 2 to 9 dB at a bit error rate of 10“®. Such SNR gains in 
GSM-MIMO compared to SM-MIMO and conventional MIMO 
can be attributed to the fact that, because of a larger number 
of spatial index bits, GSM-MIMO can use a lower-order QAM 
alphabet which is more power efficient. 

Keywords — Large-scale MIMO systems, generalized spatial mod¬ 
ulation, GSM-MIMO receiver, channel hardening, message passing. 

I. Introduction 

Large-scale MIMO systems with tens to hundreds of anten¬ 
nas are getting increased research attention IIl-El. Because 
of its advantages of very high spectral efficiencies/sum rates, 
increased reliability, and power efficiency, large-scale MIMO 
technology is being considered as a potential technology for 
next generation (example, 5G) wireless systems 0. The fol¬ 
lowing two characteristics are typical in conventional MIMO 
systems: (*) there will be one transmit radio frequency (RF) 
chain for each transmit antenna (i.e., if n* is the number of 
transmit antennas, then the number of transmit RF chains, 
Urf, will also be rit), and {ii) information bits are carried 
only on the modulation symbols (example, QAM). Conven¬ 
tional multiuser MIMO systems with a large number (tens to 
hundreds) of antennas at the base station (BS) are referred 
to as ‘massive MIMO’ systems in the recent literature 0,0. 
Key technological issues that need to be addressed in practical 
realization of large-scale MIMO systems include design and 
placement of compact antennas, multiple RF chains, and large- 
dimension transmit/receive signal processing techniques and 
algorithms 0-0. 


Spatial modulation (SM), an attractive modulation scheme 
for multi-antenna communications 0,0, can alleviate the 
requirement of multiple transmit RF chains in MIMO systems. 
In SM, the transmitter has multiple transmit antennas but 
only one transmit RF chain. In a given channel use, only 
one of the rit transmit antennas will be activated, and the 
remaining n* — 1 antennas remain silent. On the active transmit 
antenna, a symbol from a conventional modulation alphabet 
A is transmitted. In addition to information bits conveyed 
through the modulation symbol from A, the index of the active 
transmit antenna also conveys information bits. Therefore, 
the number of bits conveyed in one channel use in SM is 
[log 2 ntJ + [log 2 |A|J. Space shift keying (SSK) is a special 
case of SM. Like in SM, in SSK also only one antenna 
among rit antennas is activated in a given channel use. On 
the activated antenna, instead of sending a symbol from a 
conventional alphabet as is done in SM, a constant amplitude 
signal (say, H-1) is transmitted in SSK. Therefore, the number 
of information bits conveyed in one channel use in SSK is 
[log 2 nt\ ■ For example, for n* = 4, the two-bit combinations 
{00,01,10,11} are mapped to antenna indices {1,2,3,4}; 
antenna 1 is activated when input bits are 00, antenna 2 is 
activated when input bits are 01, antenna 3 is activated when 
input bits are 10, and antenna 4 is activated when input bits 
are 11. The problem of SSK signal detection at the receiver 
in a given channel use then becomes one of finding which 
one among the n* antennas was activated, i.e., determining 
the index of the active antenna. Assuming that the choice of 
an active antenna among all antennas is equally likely and a 
‘h-1’ was sent on the active antenna, and that the channel gains 
from Tit transmit antennas to rir receive antennas are known 
at the receiver, the maximum likelihood (ML) decision rule to 
find the active antenna index is given by 

J = arg iniri ||y-hjf, (1) 

j, l<J<nt 

where y is the x 1 received signal vector and hj is the n^xl 
channel gain vector from transmit antenna j to the receive 
antennas. The estimated antenna index j is then demapped 
to the information bits which represent that index. In SM 
signal detection, in addition to detecting the active antenna 
index, information bits conveyed through the conventional 
modulation symbol carried on the active antenna also have 
to be detected. 

A lot of recent research has focused on SM and SSK in 
point-to-point as well as cooperative relaying settings (see 
0,0 and the references therein). Bit error performance of 
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SSK and SM in single-user point-to-point communication 
has been analyzed in |[8l,||9|- Transmit diversity schemes for 
SM MIMO (i.e., systems that combine SM and space-time 
coding) have been analyzed in JTOl . Multiuser SM MIMO 
on the downlink has been analyzed in m. SSK and SM 
employed on the uplink in multiuser MIMO systems have 
been studied in im-flB- In I!l-I!6l, it has been shown 
that, for the same spectral efficiency, multiuser SM-MIMO can 
outperform conventional multiuser MIMO. In this paper, we 
are interested in a modulation scheme which is a generalization 
of SM, referred to as generalized spatial modulation (GSM) 
ini,ii8i. GSM was introduced in ini and its achievable rate 
was studied in detail in m. Here, we are interested in the 
performance analysis and signal detection of multiuser GSM 
on the uplink in large-scale MIMO systems. Such a study has 
not been reported in the literature before. 

In GSM, the number of transmit RF chains, n^/, is param¬ 
eterized such that 1 < Urf < n*, and, in a given channel 
use, Urf out of Tit transmit antennas are chosen and activated 
The remaining n* — n^/ antennas remain silent. 
On the chosen antennas, n^f modulation symbols (one on each 
chosen antenna) are transmitted. The indices of the n^/ active 
antennas out of nt available antennas convey [log 2 
information bits. This is in addition to the nr/[log 2 |A|J 
information bits conveyed by the n^/ modulation symbols. 
It can be seen that both SM and spatial multiplexing turn out 
to be special cases of GSM with Urf = 1 and rirf = rit, 
respectively. In ifTSll . it has been shown that for a given 
modulation alphabet and rit, there exists an optimum rirf in 
GSM that maximizes the spectral efficiency. 

In this paper, we consider the uplink in multiuser MIMO 
systems, where the BS has a large number of receive antennas 
(tens to hundreds) and each user terminal employs GSM with 
rit transmit antennas and Urf transmit RF chains. Some GSM 
configurations of interest at the user terminal include; (rit = 4, 
Urf = 2), (rit = 8, Urf = 2). When n* > 1 and rirf = 1, 
GSM specializes to SM; example, (rit = 2, rirf = 1), (nt = 4, 
rirf = !)■ When n* = 1 and = 1, GSM specializes to 
conventional modulation. Our contributions in this paper can 
be summarized as follows. 

• We first analyze the average bit error probability (ABEP) 
of multiuser GSM-MIMO under maximum-likelihood 
(ML) detection. We derive an upper bound on the ABEP, 
which is tight at moderate to high signal-to-noise ratios 
(SNR). 

> We then propose low-complexity algorithms for GSM- 
MIMO signal detection and channel estimation at the BS 
receiver based on message passing. The proposed receiver 
algorithms scale very well in complexity and achieve 
near-ML performance in large dimensions. Simulation 
results show that, for the same spectral efficiency, mul¬ 
tiuser GSM-MIMO can outperform multiuser SM-MIMO 
as well as conventional multiuser MIMO, by about 2 to 9 
dB at a bit error rate (BER) of 10“^. Such SNR gains in 
GSM-MIMO compared to SM-MIMO and conventional 
MIMO can be attributed to the fact that, because of a 
larger number of spatial index bits, GSM-MIMO can 



Fig. 1. GSM transmitter. 

use a lower-order QAM alphabet which is more power 
efficient. 

« We carry out a study of the proposed and existing 
algorithms, which includes an assessment of their perfor¬ 
mance and computational complexity. Simulation results 
show that the proposed detection algorithms have lesser 
complexity than minimum mean square error (MMSE) 
detection complexity, while achieving significantly better 
performance than MMSE detection performance. 

The rest of the paper is organized as follows. The multiuser 
GSM-MIMO system model on the uplink is presented in 
Section In] In Section [Till we derive an analytical upper bound 
on the ML detection performance in multiuser GSM-MIMO. 
In Section |IV] we present the proposed detection and channel 
estimation algorithms for multiuser GSM-MIMO and their 
performance. In Section |V] the performance of the proposed 
algorithms in frequency-selective fading are presented. Con¬ 
clusions are presented in Section |Vll 

II. Multiuser GSM-MIMO system model 

Consider a multiuser system with K uplink users commu¬ 
nicating with a BS having N receive antennas, where N is in 
the order of tens to hundreds. The ratio K/N is the system 
loading factor. Users employ GSM for their transmission. Each 
user has rit transmit antennas and rirf, 1 < n-rf < rit, transmit 
RE chains. An rirf x rit switch connects the RE chains to the 
transmit antennas. In a given channel use, each user selects 
rirf of its Tit transmit antennas, and transmits symbols 
from a modulation alphabet A on the selected antennas. The 
remaining n* — rirf antennas remain silent (i.e., they can be 
viewed as transmitting the value zero). The GSM transmitter at 
the user terminal is shown in Eig.[T] The selection of rirf active 
antennas is made based on [log 2 )J information bits. Eor 
example, for n* = 4 and = 2, two out of the four antennas 
are selected using [log 2 ( 2 )] = Uog 2 6J = [2.585J = 2 
information bits. The mapping of information bits to active 
antenna indices in GSM is described below. 

Define an ‘antenna activation pattern’ to be an n* x 1 vector 
consisting of I’s and O’s, where a 1 in a coordinate indicates 
that the antenna corresponding to that coordinate is active and 
a 0 indicates that the corresponding antenna is silent. Note 
that there are (^‘) activation patterns possible. Eor example, 
for rit = 4 and rirf = 2, the following six activation patterns 
are possible: 

[1 1 0 0 ]^, [0 0 1 1 ]^, [1 0 1 0 ]^, [0 1 0 1 ]^, [10 0 1 ]^, [0 1 1 0 ]^ 
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Out of the possible activation patterns, only 
activation patterns are needed for signaling. Let S denote the 
set of these 2^*°®^ activation patterns chosen from the set 

of all possible patterns, i.e., |5| = 2^^°®^^"'-/^^. In the above 
example, let the set of chosen activation patterns be 

5 = {[1 1 0 0 ]"^, [1 0 1 0 ]"^, [10 0 1 ]"^, [0 1 1 0 ]^}. 

A mapping is done between combinations of [log 2 {n*f)\ 
information bits to activation patterns in S. The following table 
shows such a mapping for the rit = 4, n^/ = 2 example: 


Information bits 

Antenna activation pattern 

Remarks 

00 

[110 0] 'J 

antennas 1,2: Active; 
antennas 3,4: Silent 

01 

[10 1 0]'^ 

antennas 1,3: Active; 
antennas 2,4: Silent 

10 

[10 0 1]'J' 

antennas 1,4: Active; 
antennas 2,3: Silent 

11 

[oil 0]'^ 

antennas 2,3: Active; 
antennas 1,4: Silent 


TABLE I 

Mapping between information bits and active antenna indices 
IN GSM FOR nt = 4, n^f = 2. 


Note that the mapping does not need channel state in¬ 
formation. Also, the mapping rule between information bits 
and active antenna indices is made known to the transmitter 
and receiver a priori for encoding and decoding purposes, 
respectively. 

Apart from the bits conveyed through active antenna indices, 
additional bits are conveyed through modulation symbols sent 
through the Urf active antennas. Therefore, the total number 
of bits conveyed by a GSM transmitter per channel use is 
given by 


log2 


nt 
Urf / J 


■ rirf Llog 2 |A|J bpcu. 


For example, a GSM transmitter with rit = 4, Urf = 2 and 
4-QAM conveys 6 bpcu. 

GSM signal set: Let denote the GSM signal set, which 
is the set of GSM signal vectors that can be transmitted. Then, 
ClfA is given by 


= {s : Sj e AU{0}, llsjlo = Urf, I(s) G S}, (2) 


where s is the n* x 1 transmit vector, sj is the jth entry of s, 
j = 1, - ■ ■ ,nt, ||s||o is the Zg-norm of the vector s, and I(s) is 
a function that gives the activation pattern for s; for example, 
I(s = [+1 - 1 - 1 0]'^) = [111 0]^. 

Example: Let nt = 4, Urf = 2, BPSK modulation, 
and 5 = {[1 1 0 0]"^, [1 0 1 0]^, [10 0 1]^, [0 1 1 0]^}. The 
GSM signal set for these parameters is given by 
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Figures |2(a)| and |2(b)| illustrate large-scale multiuser GSM- 
MIMO system and conventional multiuser MIMO (massive 


MIMO) system, respectively. Let G a denote the trans¬ 
mit vector from user k. Let x = [x^ x|’ • • • x^ • • • x^]^ 
denote the vector comprising of transmit vectors from all 
the users, where (.)^ denotes transpose operation. Note that 

xe(C:fA)^- 

Let H G denote the channel gain matrix, where 

denotes the complex channel gain from the jth 
transmit antenna of the fcth user to the ith BS receive antenna. 
The channel gains are assumed to be independent Gaussian 
with zero mean and variance cr^, such that = Knt. 

The (7^ models the imbalance in the received power from the 
Kth antenna, k G {I,-- - ,Knt}, due to path loss etc., and 
= 1 corresponds to the case of perfect power control. 
Assuming perfect synchronization, the received signal at the 
ith BS antenna is given by 

K 

Vi ~ ^ ^ h^JfcjXfc -f (3) 

where hj [fej is a 1 x ri; vector obtained from the ith row of H 
and (fc — l)nt + 1 to knt columns of H, and Ui is the noise 
modeled as a complex Gaussian random variable with zero 
mean and variance The received signal at the BS antennas 
can be written in vector form as 

y = Hx-bn, (4) 

where y = [yi, y 2 , • • • , VnV and n = [m, n 2 , • • • , n^]'^ . 
For this system model, the ML detection rule is given by 

x= argmin jjy - Hxf, (5) 

where jjy — Hxjp is the ML cost. The maximum a posteriori 
probability (MAP) decision rule is given by 

x= argmax Pr(x|y,H). (6) 

Since = (|5||A|"'’t')^, the exact computation of 

(O and ((6]l requires exponential complexity in K. In the next 
section, we derive an analytical upper bound on the ABEP 
for ML detection. In Section |IV] we propose message passing 
based detection algorithms which give approximate solutions 
to ((6]l at low complexities. 

We note that the condition for the spectral efficiencies 
of GSM-MIMO (with nt transmit antennas, nrf transmit 
RF chains, and modulation alphabet A at each user) and 
conventional multiuser MIMO (with mt transmit antennas, nit 
transmit RF chains, and modulation alphabet B at each user) 
to be the same is given by 

i5||A|"'-^ = jir*. 

For example, (i) GSM-MIMO with BPSK, n* = 4, nrf = 2, 

(ii) SM-MIMO with 4-QAM, nt = 4, nrf = 1, and 

(iii) conventional multiuser MIMO with 4-QAM, 2 transmit 

antennas and 2 transmit RF chains, all have the same spectral 
efficiency of 4 bpcu per user. In conventional multiuser MIMO, 
the vector x G and the channel matrix H G 
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(a) Multiuser GSM-MIMO system. (b) Conventional multiuser MIMO system. 

Fig. 2. Large-scale multiuser GSM-MIMO and conventional multiuser MIMO system architectures. 


III. Average BEP Analysis 

In this section, we analyze the average BEP performance 
of ML detection in multiuser GSM-MIMO. Assume that all 
the transmit GSM signal vectors are equally likely. The ML 
detection rule in (|5]l can be written as 

Knt 

X = argmin ||y - ^ (7) 

where is the Kth element of x, is the Kth column of H 
and G = The pairwise error probability (PEP) of x 

being decoded as x G G can be written as 

Knt Knt 

P(x ^ x|H) = p(^\\y - > lly - 

K.=l «;=! 

N Knt N Knt 

= p(^Y\y^ (8) 

i = l «;=! i = l «;=! 

where hi ^ is the ith element of h„. Let Ai = Xf-hi ^ 

and Ai = Xi^hi^K.- Since x is the transmitted vector, 

yi = Ai + rii, i = 1, ■ ■ ■ ,N. Now, we can write 

N N 

P(x^x|H) = P(y]|yi-A,|^ 

i = l i=l 

N N 

i=l i=l 

N N 

= P(y]2»((ii-A0n*) (9) 

i=l i=\ 

where 5ft(.) denotes real part, (.)* denotes conjugation, and 
Lti — Ai)n*) is a Gaussian random variable with 

mean zero and variance 2a^ Lili 1^® ~ 4^^. Therefore, 


P(x^x|H) = Q 


N \ 

i-1 / 



||2 

= Q 

[\ 

/2a^ 

K = 1 


The argument in (fTol i has a central -distribution with 2N 
degrees of freedom. Computation of the unconditional PEP 
P(x —x) requires the expectation of the Q{.) function in 
(Uni) w.r.t. H, which can be obtained as follows ED: 

P(x ^ x) = ]Eh{P(x -> x|H)} 

= /(«)^E (11) 

i=0 V * / 

where /(a) = a = 4 ^ E and 9^ = 

\xk, — Xk]"^. Now, an upper bound on the average BEP based 
on union bounding can be obtained as 

xGG xeG\x 

where (i(x, x) is the number of bits in which x differs from 
X. The total number of PEPs to be calculated in (O is 
2’7(2’t— 1 ). Therefore, the complexity of the computation of the 
bound in (fTSl i will increase exponentially in K,nt, and n^/- 
In the following subsection, we devise simplification methods 
to reduce this computational complexity. 

A. Reduction of computation complexity in ( 1721) 

The expression for Pg in (fT 2 l i can be written in the 
following form: 

E Enx ^ x )!^. 

i=l x:I(x) = Sie5^ 

XT^x 


( 13 ) 
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For a given pair of antenna activation patterns and Sj, i,j € 
{1, • • • , |5|^}, the total number of PEPs are when 

i ^ j, and (|A|-^"''’f — 1) when i = j. 

Complexity reduction 1: For a pair of activation patterns 
Si and Sj, let Aij denote the set of active antennas that are 
common to both Si and Sj. Define Pij = Krirf — \ Aij\. Note 
that Pij G {0,1, • • • , min(nr./, nt — nrf)Ii}. Also, note that 
for any i,j for which /3ij = q, the value of the summation 
Y] Y P(x —x) in (foi l will be the same, and 

x:I(x)=Si x:I(x)=S3.Xi^x 

SO it is enough to compute this summation only once for each 
q. With this simplification, ( fOl l can be written as 


min(n^y ,nt —nj,f)K 

pb <^ E <^( 5 ) E 

<3'=0 x:I(x)=Si 


E^(^ 

x:I(x)=Sj 


d(x, x) 
X) , 


Pij=^ 

where (j){q) is the number of {si,Sj) pairs for which /3ij = q, 
which can be computed easily. 

Example: A direct computation of the first two summa¬ 
tions in (O is prohibitive, as the total number of terms 
is exponential in K,nt, Urf- For K = 2,nt = 4 and 
Urf = 2, |5^| = 2“^. So, the first two summations in (fTJI i 
will have 2® = 256 terms. Whereas for these parameters, 
q e {0,1,2,3,4}, </)({0,l,2,3,4}) = {16,88,128,22,2} 
and ^( 9 ) “ Hence, the inner summations can be 

computed only 5 times (once for each q), instead of 256 times. 

Complexity reduction 2: For each value of q, we need to 
compute |Ap^"'’t' PEPs. We propose to reduce this complex¬ 
ity as follows. The parameter a in (fTTli is the summation of 
Knt terms. Out of these Knt terms, Knt — {Krirf+ q) terms 
will be zero for a given value of q. Of the {Krirf +q) non-zero 
terms, 2q terms will take values from J = {|cp : c € A}, and 
Krirf— q terms will take values from L = {|c—cp : c, c € A}. 
Fet J = Am} and L = ^n}, where 

ji < 32 <■■■ < jm, h < h <■■■< In, m = |JJ|, and 
n= |L|. We write a as a = ai +a 2 , where ai is the sum of 
2q terms from J and a 2 is the sum of Krirf — q terms from L. 
Note that ai can take values in the range 2qji to 2qjm- For a 
given value of ai, the following equations must be satisfied; 


'^jiVi = ai, '^Vi = 2q, (14) 

2=^1 2—1 

where Vi is an integer such that Vi G {0,1,-- - ,[( 0:1 — 
SfcLz+i JkVk)/jiJ}- Similarly, a 2 can take values in the range 
{Krirf — q)h to {Krirf — q)ln, and, for a given value of a2, 
the following equations must be satisfied; 

n n 

'^kui =a 2 , y^Ui = Krirf - q, (15) 

2=1 2=1 

where Ui is an integer such that Ui G {0,1,--- , [(a 2 — 
Yl^i+i huk)/li\}- Since a = cti -f 0 : 2 , a lies in the range 
2qji + {Krirf — q)li to 2qjm + {Krirf — q)ln- A given value 
of a can be written as 

m n m n 

a = E jiVi + E E ^ E ^ Krirf - q- 

i —1 2=1 2 = 1 2=1 

(16) 

The choices of Vi’s and Ui’s to attain a particular a is 
not unique, i.e., there exist multiple pairs of x and x that 



Fig. 3. Comparison between analytical ABEP upper bound and simulated 
ABEP for ML detection in GSM-MIMO with N = 16, 32, X = 4, nt = 4, 
= 2, BPSK, and 4 bpcu per user. Analysis and simulation. 

correspond to different values of u^’s and iti’s but the same 
value of a. Thus, we need to evaluate (fTTli only once for a 
given a and count the number of possible combinations of 
Ui’s and Ui’s that correspond to that a. 

Example: When n* = 4, rirf = 3, A = {—1 — j, —1 -f j, 1 — 
j, 1 + jj, where j = then, J = {2}, L = {0,4,8}. 

For a particular value of q, say <7=1, the summation in (fl4li 
requires computation of PEP for 64 different pairs of GSM 
signals. But a lies in the range 2 to 18, and hence we need 
to compute only 17 PEP terms. 

B. Numerical results 

In Fig. [3 we compare the analytical ABEP upper bound 
and the simulated ABEP of multiuser GSM-MIMO with ME 
detection for the following system parameter settings; N = 
16, 32, AT = 4, n* = 4, n^/ = 2, BPSK, and 4 bpcu per user. 
It can be observed that the upper bound is tight at moderate to 
high SNRs. It is also observed that, as expected, both analysis 
and simulation predict that the ABEP performance improves 
as the number of BS antennas N is increased. 

In Fig. m we compare the ABEP performance of the 
following four different systems with N = 8 and K = 2; 
System 1 - conventional multiuser MIMO with rit = rirf = 1, 
16-QAM; System 2 - conventional multiuser MIMO with 
rii = rirf — 2, 8-QAM; System 3 - multiuser SM-MIMO 
with rit = 4, rirf = 1, 16-QAM; and System 4 - multiuser 
GSM-MIMO with rit = 4, rirf = 2, 4-QAM. Note that all the 
four systems achieve the same spectral efficiency of 6 bpcu per 
user. The first two systems are conventional multiuser MIMO 
systems where nt = Urf- System I uses one transmit antenna 
and one transmit RE chain at each user and achieves 6 bpcu 
per user by using 64-QAM. On the other hand. System 2 uses 
two transmit antennas and two transmit RE chains at each 
user and achieves 6 bpcu per user by using 8-QAM. System 
3 is a multiuser SM-MIMO system where each user uses four 
transmit antennas but only one transmit RE chain. Each user 
in this system uses 16-QAM to achieve 6 bpcu per user; 4 
bits through 16-QAM and 2 bits through indexing. System 4 
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Fig. 4. Comparison between the ABEP performance of four different systems, 
all with N = S, K = 2, and 6 bpcu per user: (i) conventional multiuser 
MIMO, nt = rirf = 1, 64-QAM; (ii) conventional multiuser MIMO, nt = 
n-rf = 2, 8-QAM; (Hi) multiuser SM-MIMO, nt = 4, n-rf = 1, 16-QAM; 
and (iv) multiuser GSM-MIMO, nt = 4, n^-f = 2, 4-QAM. Analysis and 
simulation. 

is a GSM-MIMO system where each user uses four transmit 
antennas and two transmit RF chains. This system uses 4- 
QAM on two streams to achieve 6 bpcu per user; four bits 
through modulation symbols (i.e., two 4-QAM symbols on 
two streams) and 2 bits through indexing. 

The following observations can be made from Fig. |4] First, 
it can be observed that multiuser SM-MIMO system (System 
3) achieves better performance compared to conventional 
multiuser MIMO systems (Systems 1 and 2) by about 2 to 7 
dB at 10“^ BER. The better performance of SM-MIMO over 
conventional MIMO in System 1 can be attributed to the fact 
that, to achieve the same spectral efficiency of 6 bpcu per user, 
SM-MIMO can use a smaller-sized QAM alphabet (16-QAM) 
than that used in system 1 (64-QAM), and that a smaller-sized 
QAM alphabet is more power efficient than a larger-sized one. 
Also, the better performance of SM-MIMO over conventional 
MIMO in system 2 is attributed to the fact that, although 
System 2 uses a smaller-sized alphabet (8-QAM) than that 
in SM-MIMO (16-QAM), there is interference from multiple 
spatial streams in System 2. Another observation in Fig. |4] 
is that multiuser GSM-MIMO system (System 4) performs 
better than multiuser SM-MIMO system (System 3) as well as 
conventional MIMO system in System 2 which also uses two 
RF chains like GSM-MIMO. This is because, though GSM- 
MIMO uses two spatial streams like conventional MIMO, its 
alphabet size is smaller than that in conventional MIMO. 

IV. Low-complexity Receiver Algorithms 

Optimal detection of multiuser GSM-MIMO signals in large 
dimensions is prohibitively complex. Generalized sphere de¬ 
coding (GSD) approach lfT9l . ll20l can be employed for GSM- 
MIMO detection. But the complexity of GSD can be high 
(exponential complexity at low-to-medium SNRs). Therefore, 
for large systems, GSD becomes prohibitively complex. In this 
section, we present low-complexity near-optimal detection and 
channel estimation algorithms for large-scale multiuser GSM- 
MIMO systems. Two approximate message passing based 


algorithms for detection are presented. In the first algorithm, 
abbreviated as MP-GSM (message passing for GSM) detection 
algorithm, messages are formed based on the basic multiuser 
GSM-MIMO system model in (|4]i. In the second algorithm, 
abbreviated as CHEMP-GSM (channel hardening-exploiting 
message passing for GSM) algorithm, messages are formed 
based on a matched filtered version of the basic system model 
in (0. We also present a channel estimation approach that 
directly obtains an estimate of H^H for use in the CHEMP- 
GSM algorithm. 

A. MP-GSM detection algorithm 

Consider the multiuser GSM-MIMO system model in 0. 
We model this system as a fully connected factor graph with 
K variable nodes (or factor nodes) corresponding to x^’s 
and N observation nodes corresponding to y^’s, as shown in 
Pig. |5(a)| We aim to get approximate MAP solution through 
message passing on this graph, where messages are formed 
by approximating the probability density of the interference 
as Gaussian. 

Messages: The messages passed between variable nodes and 
observation nodes in the factor graph are derived as follows. 
Equation 0 can be written as 


K 

2/i = hj_[fe]Xfe-f ^ h,_[j]Xj+ni, 

i=i j'/fc 


(17) 


= Qik 


where [jj is a row vector of length nt, given by 

+ t Hi (^j i)n t+2 • ■ ■ Hijnt], ^nd Xj € ^n/,A' 

term gik defined in dnii is approximated to be a Gaussian 
random variableQ with mean fiik and variance The mean 
Hik in the approximation is given by 

r ^ 

pik — ^ ^ ^ 

K 

K 

= E E 

where st’s are the non-zero entries in s and /’s are their 
corresponding indices, and the variance cr^c is given by 

cTik = Varf hi.blXj + rii j 

K n n I |2 2 

E Pn(s)hi,y]ss"hfy] - ^pji(s)hi,y]s -Tcr^ 


3 = 1 




(19) 

where (.)^ denotes conjugate transpose operation, and pki{s) 
is the posterior probability given by 

N I . |2 

Pkt' 


N _| _ _ Vi 1^ 

/ \ TT / 2/m /^mfc ^m,[fe]S \ 

k^{V 11 exp(^—^ 

m = 1 m k 


*This Gaussian approximation will be accurate for large K‘, example, in 
systems with tens of users. 
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(b) Observation node messages 
Fig. 5. The factor graph and messages passed in the MP-GSM algorithm. 


Message passing: The messages exchanged between obser¬ 
vation and variable nodes are illustrated in Figs. |5(b)| and |5(c^ 
The message from observation node yi to variable node Xfc 
consists of the two scalar variables pik and The message 
from variable node to observation node yi is a vector 
message given by pki = [pki{si),pki{s 2 ), ■ ■ ■ ,pki{s,^^rf :)]. 
The message passing steps are as follows. 

Step 1: Initialize Pki{s) to ^1 for all i, k and s. 

Step 2: Compute pik and from (fTSl l and (fTTt . respectively. 
Step 3: Compute pki from (l20l i. To improve the convergence 
rate, damping of the messages in (l20l i is done with a damping 
factor S G (0,1], as shown in the algorithm listing in Algo¬ 
rithm 1. The symbol probabilities at the end of an iteration 
are computed as 

w I u |2 

/ \ TT /“ J/i ~ Fife ~ \ 

Pk{s) oc|J_exp(^— ^ - —2 - -j,k = !,■■■ ,K. (21) 

i—1 ife' 


Stopping criterion: Repeat Steps 2 and 3 until 
||p(*) _ p(*+^)|| < e or until the number of iterations 

is less than a fixed maximum number, where = 




(i): 


nt,Al 






and e is a pre-fixed constant such that 0 < e -C 1. 

The detected vector of the fcth user at the BS is obtained as 


Xfc = argmax pfc(s). (22) 

The non-zero entries in Xfc and their indices are then demapped 
to obtain the information bits of the fcth user. The MP-GSM 
algorithm listing is given in Algorithm [T] 

Complexity and performance of MP-GSM algorithm: Here, 
we present and discuss the complexity and performance of the 
MP-GSM detection algorithm. 

Complexity: From (fTSl l. ( fT9] l. and ( l20l i. we see that the 
total complexity of the MP-GSM algorithm is 
We compare this complexity with that of the detection using 
the MMSE estimate given by (H^H -f The 

^Damping can be thought of as reweighting the messages with a reweighting 
parameter (damping factor) 5 mi-iiii. 


Input: y, H, 

Initialize: p™(s) t- l/|S"lfAl’ k, s 
for t = 1 —> number_of_iterations do 

for i = 1 — A do 
for j = 1 ^ K do 

Pij 

ggg"*-/ - iei(s) 


p% E 


E 




• .b'l 


1 Aij I 


end 


K 

E pij 

f=l 

K 

(Ti i 2_^ ^ij “ 1 “ ^ 

7 = 1 

for k = 1 ^ K do 


end 

end 

for k = 1 


^ik 

2 

^ik 


fli l-^ik 
^2 ~2 
~ ^ik 


foreach s G 


K do 

;gS„ 

ln(Pfc’(s)) 

N do 


do 


=1 


,[fcr 


end 

for i = 1 

foreach s G do 

Pkh^) •*—ln(Pfc*'(s))+ — 
_ 1-i 




ptli^) = ■^exp(p^'bs)) + Spti 

Cki is a normalizing constant 


end 


end 

end 
end 

Output: Pk{s) as per 1211 and Xfc as per ll221 . Vfc 
Algorithm 1: Listing of MP-GSM algorithm. 


complexity of this MMSE detection for the system model 
in (HI is given by 0{N^Knt). In Table [III we present a 
complexity comparison between the MP-GSM and MMSE 
detection algorithms. Erom Table |II] it can be seen that 
the MP-GSM detection complexity is less than the MMSE 
detection complexity. In addition to having this complexity 
advantage over MMSE detection, MP-GSM detection achieves 
significantly better performance than MMSE detection (we 
will see this in the performance results presented next). We 
further note that the computation of double summation in (fTSl l 
and ( fT9] l in the MP-GSM algorithm can further be simplified 
by using PET, as the double summation can be viewed as a 
convolution operation. 

Performance: We evaluated the BER performance of the 
MP-GSM detection algorithm in large-scale multiuser GSM- 
MIMO systems by simulations. Here, we assume perfect 
channel state information (CSI) at the receiver. We will relax 
this assumption later. Eigure |6] presents the performance of 
MP-GSM detection algorithm in a large-scale multiuser GSM- 
MIMO system with the following system parameters: K = 16, 
N = 64,128, rit = 4, Urf = 2, and 4-QAM. Note that 
the spectral efficiency in this system is 6 bpcu per user. 
We compare the performance of this system with two other 
systems which also have the same spectral efficiency of 6 bpcu 
per user. These systems are: 1) conventional multiuser MIMO 
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system with rit = Urf = 1, 64-QAM, and ML detection using 
sphere decoding (note that this is massive MIMO system; we 
abbreviate it as M-MIMO in the figures), and 2) multiuser 
SM-MIMO system with rit = 4, rirf = 1, 16-QAM, and 
MP-GSM detection. From Fig. |6] we observe that GSM- 
MIMO outperforms both SM-MIMO as well as conventional 
MIMO. For example, at a BER of 10“^, GSM-MIMO has a 
performance advantage of about 4 dB over SM-MIMO and 
about 7 dB over conventional MIMO. This observation is in 
conformance with similar performance advantage of GSM- 
MIMO over SM-MIMO and conventional MIMO predicted 
by analytical upper bounds in Section |III] 

Performance comparison with other detectors: Next, in 
Fig. |7] we compare the performance of MP-GSM detection 
with that of MMSE detection in multiuser GSM-MIMO with 
K = IQ, N = 64,128, nt = 4, rirf = 2, 4-QAM, 
and 6 bpcu per user. Erom Eig. |7] we observe that the 
performance of MP-GSM algorithm is better than MMSE 
detection performance by 9 dB for N = QA and 3 dB for 
N = 128 at a BER of 10“^. As noted in the discussion on 
complexity, MP-GSM achieves this better performance than 
MMSE at a lesser complexity than MMSE (as illustrated in 
TableUD. Iterative detection/decoding schemes that use MMSE 
filters and provide further refinements are common in the 
literature In addition to the comparison with MMSE 

detection performance, Eig. |7] also presents a comparison with 
the performance of the MMSE-SIC detector and the multi¬ 
branch MMSE decision feedback (MB-MMSE-DE) detector 
(with 4 branches and ordering) proposed in 1^ . It can be 
seen that while the MB-MMSE-DE and MMSE-SIC detectors 
perform better than the MMSE detector, the proposed MP- 
GSM detector outperforms the MMSE, MMSE-SIC, and MB- 
MMSE-DE detectors. Moreover, the complexity of the MP- 
GSM detector is less than those of the MMSE, MMSE-SIC, 
MB-MMSE-DE detectors. 


K 

Complexity in number of real operations xlO® 
(GSM-MIMO with N = 128, nt = 4, = 2, 4-QAM) 

MMSE 

MP-GSM 

(Sec. ITV-AI 

CHEMP-GSM 

(Sec. ITV-BI 

16 

3.594 

2.195 

3.142 

32 

19.767 

4.391 

6.281 

64 

28.355 

8.782 

12.265 

96 

36.941 

13.173 

18.013 

128 

45.526 

17.564 

21.637 


TABLE II 

Comparison between the complexities (in number of real 
OPERATIONS) OF MMSE DETECTION, MP-GSM DETECTION, AND 
CHEMP-GSM DETECTION, IN MULTIUSER GSM MIMO WITH N = 128, 
nt = 4, n^f = 2 AND 4-QAM. 

Performance for same spectral efficiency and QAM size: We 
note that if both spectral efficiency and QAM size are to be 
kept same in GSM-MIMO and M-MIMO, then the number 
of spatial streams per user in M-MIMO has to increase. Eor 
example, GSM-MIMO can achieve 4 bpcu per user with 4- 
QAM using n* = 4 and Urf = 1. M-MIMO can achieve 
the same spectral efficiency of 4 bpcu per user using one 
spatial stream (i.e., rit = 1, n-rf = 1) with 16-QAM. But 
to achieve the same spectral efficiency using 4-QAM in M- 
MIMO, we have to use n* = 2, rirf = 2, i.e., two spatial 


M-MIMO (ML), 64-QAM 
N=64, K=16 
M-MIMO, 64-QAM 
N=128, K=16 

SM-MIMO, 16-QAM 
N=64, K=16, n=4, n^=1 

t rf 

SM-MIMO, 16-QAM 
N=128, K=16, n=4, n =1 
t rf 

GSM-MIMO, 4-QAM 
N=64, K=16, n=4, n =2 
t rf 

GSM-MIMO, 4-QAM 
N=128, K=16, n=4, n =2 
t rf 

' : 6 bpcu per user 


\ 

• ■ ■ v • 

\ 


20 25 

Average SNR in dB 

Fig. 6. BER performance of three different multiuser systems with the same 
spectral efficiency of 6 bpcu per user, K = 16, N = 64,128: i) M-MIMO 
with nt = n^f = 1, 64-QAM, sphere decoding; ii) SM-MIMO with nt = 4, 
n^f = 1, 16-QAM, MP-GSM detection; iii) GSM-MIMO with nt = 4, 
riff = 2, 4-QAM, MP-GSM detection. 



Fig. 7. BER performance comparison between i) MP-GSM detector, ii) 
MMSE detector, iii) MMSE-SIC detector, and iv) MB-MMSE-DF detector 
in 1361 , in multiuser GSM-MIMO with K = 16, N = 64,128, nt = 4, 
riff = 2, 4-QAM, and 6 bpcu per user. 

streams per user with 4-QAM on each stream are needed. This 
increase in number of spatial streams per user increases the 
spatial interference. The effect of increase in number of spatial 
streams per user in M-MIMO for the same spectral efficiency 
on the performance is illustrated in Fig. [8] for a: = 16 and 
N = 128. 

In Eig.0 we compare the performance of the following four 
systems with the same spectral efficiency of 4 bpcu per user, 
K = 16, and N = 128: i) GSM-MIMO with (n* = 4, rirf = 
2, BPSK), ii) M-MIMO with (n* = 1, Urf = 1, 16-QAM), iii) 
M-MIMO with (rit = 2, rirf = 2, 4-QAM), and iv) M-MIMO 
with (nt = 4, Urf = 4, BPSK). Detection in the GSM-MIMO 
system is done using the MP-GSM algorithm. Detection in the 
M-MIMO systems is done using the likelihood ascent search 
(LAS) algorithm in initialized with MMSE solution. It 
can be seen that among the four systems considered in Eig. 
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Fig. 8. BER performance of GSM-MIMO with (nt = 4, n^f = 2, BPSK), 
M-MIMO with (nt = 1, = 1, 16-QAM), M-MIMO with (m = 2, 

Hrf = 2, 4-QAM), and M-MIMO with (m = 4, n^f = 4, BPSK) for 
K = 16, N = 128, 4 bpcu per user. 



Fig. 9. Comparison of SNRs required by GSM-MIMO with (nt = 4, 
Urf = 2, 4-QAM), SM-MIMO with (nt = 4, n^f = 1, 16-QAM), SM- 
MIMO with (nt = 2, = 1, 32-QAM), and M-MIMO with (nt = 1, 

n^f = 1, 64-QAM) to achieve a target BER of 10“^ for K = IQ and 
varying N, at the same spectral efficiency of 6 bpcu per user. 


|8] GSM-MIMO performs the best. This is because M-MIMO 
loses performance because of higher-order QAM or increased 
spatial interference from increased number of spatial streams 
per user. 

In Fig. |9] we fix the number of users at AT = 16 and 
the spectral efficiency at 6 bpcu per user, vary the number 
of antennas N at the BS, and compare the SNRs required in 
various systems to achieve a target BER of 10“^. We compare 
the performance of the following four systems: i) GSM-MIMO 
with {nt = 4, Urf = 2, 4-QAM), ii) SM-MIMO with (nt = 4, 
Urf = 1, 16-QAM), Hi) SM-MIMO with (nt = 2, nrf = 1, 
32-QAM), and iv) M-MIMO with (nt = 1, Urf = 1, 64- 
QAM). From Fig. |9] it can be observed that as the number 
of antennas at the BS increases, the required SNR to achieve 
the target BER decreases in all the four systems, which is 
expected because of the increased receive diversity. The sharp 
degradation observed for small values of N is because the 
systems become under-determined when Knt > N, and hence 
the required SNRs shoot up. When Knt < N (fully/over- 
determined), GSM MIMO outperforms M-MIMO by about 9 
dB and SM-MIMO by about 4 to 6 dB. 


B. CHEMP-GSM detection algorithm 

In this subsection, we propose another detection algorithm 
based on message passing. We refer to the detection algo¬ 
rithm presented in this subsection as CHEMP-GSM (channel 
hardening exploiting message passing ll24l . ll25l for GSM) 
algorithm. We also propose a novel channel estimator for use 
in the CHEMP-GSM detector. We refer to the CHEMP-GSM 
detector along with this channel estimator as the ‘CHEMP- 
GSM receiver’. CHEMP-GSM approach is another message 
passing approach which gives less complexity than MMSE 
but performs significantly better than MMSE. 

Matched filtered system model: First, we perform a matched 
filter operation on the received signal vector y in (|4]i as H^y, 


which can be written as 

= H'f^(Hx + n). (23) 

A corresponding equivalent system model can be written as 


where 



z = Jx -f V, 

.. H^H 

N ’ 



(24) 

(25) 


Similar to x in the basic system model in (01, the vector z in 
(l24li can be viewed as a concatenation of K sub-vectors each 
of dimension n* x 1, i.e., z = [zf z^ ''' '^k ''' '^’kV ■ Like¬ 
wise, V = [vf vl’ • • • where Vj = 

is the jth element of v and iJy is the (*, j)th element of 
H. For large N, Vj can be approximated to follow Gaussian 
distribution as vj ~ CAf{0, cr^), where the variance 
Each sub-vector Zfc can be expressed as 


K 

Zfc = JfcfcXfe -I- ^ JkjXj+'Vk, (26) 

A 

= gfc 

where Jkj is a nt x nt sub-matrix of J formed from the 
elements in rows (fc—to knt and columns (j — l)nt-|-l 
to jnt, i.e., J can be written in terms of the sub-matrices as 


Jll 

J12 

JlJf 

J21 

J22 

J2Jf 

Jifl 

^K2 ■ ■ 

^KK 


The vector defined in (|2^ denotes the interference-plus- 
noise to the /cth user’s GSM signal. This term g^ involves the 
off-diagonal elements of J (i.e., Jkj, k j where Jkj is the 
{k,j)th element in J). Due to channel hardening that occurs in 
large MIMO channels, the matrix J (and hence Jfefe,Vfc) has 
strong diagonal elements compared to off-diagonal terms for 
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where 


lE(xj) = 


(29) 


Vs. 


Cov(xj) = ^ ss^pj(s) — E(xj)E(xj)^, (30) 

Vs, ses„^-( 


and 


Pfc(s) = Pr (xfc = s), se 


(31) 


Fig. 10. Message passing in CHEMP-GSM algorithm. 

large N, K. We approximate gk to be multivariate Gaussian 
with mean /x^ and covariance which can be written as 

K 

Mfc = lE(gfc) = X] (27) 

K 

Sfc = Cov(g/c) = ^ JfejCov(xj)jg- +cr^I„t, (28) 



Fig. 11. BER performance of multiuser GSM-MIMO systems with {nt = 4, 
= 2, 4-QAM) and (nt = 4, n^y = 1 using CHEMP-GSM detection, and 
M-MIMO system with (nt = 1, n^y = 1, 64-QAM) using .sphere decoding, 
at 4 bpcu per user, K = 16, N = 64,128. 

improve the rate of convergence. That is, if is the computed 
probability vector at the fth iteration, the message at the end 
of fth iteration is computed as 


Pfc = (l-'5)Pfc 


SpI 


(34) 


Repeat Step 2 for a certain number of iterations, after which 
the algorithm stops. The estimate of the ^th modulation symbol 
transmitted by the fcth user is obtained as 


Sk,i = argmax 

sGA 


E 


Pkis), (35) 


Let Pfc = [pfc(si),pfc(s 2 ),--- ,pfc(S|g^,./^|)] denote the vector 
of probability masses corresponding to GSM signal vector 
Xfc. The probability masses pfc(s)’s are approximated with the 
corresponding a posteriori probabilities (APP), i.e., 

Pfe(s) t-Pr(xfc = s|zfe, J) (32) 

oc exp (—(zfc - Jfcfes - /xj,)^Ej:^(zfc - JfefcS - /xj,)^(33) 

Message passing: The system is modeled as a fully- 
connected graph with K nodes, where the fcth node is an 
approximate APP processor corresponding to Xfc. Node k 
computes the APP based on the incoming messages and the 
knowledge of Zfc and J. The incoming messages to node fc 
are the APPs from the remaining nodes. The APP computed in 
node fc, in turn, is passed to the remaining nodes for their APP 
computation in the next iteration. The messages exchanged 
between the nodes are illustrated in Fig. [TO] The algorithm is 
as follows. 

Step 1: The probability vectors pfc’s are initialized with 
equiprobable masses 

Step 2: Node fc computes pfc as per (l33l l with the incoming 
vector messages {pi,p 2 ,-- - , pfe-i, Pfe+i, • • ■ ,px}. Damping 
of messages with damping factor S G [0,1) is done to 


Vs, : Afi(s) — s 

where A)(s) is the ^th non-zero element in s and I G 
{1,2,-•• ,nrf}. An estimate of the active antenna indices 
chosen for transmission by the fcth user is obtained as 


qk = argmax 


E 


Pkis). (36) 


Vs. s6S^^4 


The values of Sk,i and qk are then demapped to obtain the 
information bits of the fcth user. 

Complexity: The orders of complexity for the computation 
of z and J are 0{NKnt) and OiN respectively. The 
complexities for the computation of (l27l i. (l28l l and (l3^ are of 
orders 0 {n^K‘^), Oin^K^) and (T(n(respectively. 
For N > K and K > nt, the overall complexity of 
the algorithm is dominated by the computation of J whose 
complexity is 0{NK^nt). Therefore, the overall complexity 
of the algorithm is OiNK^nf). This complexity is less than 
the MMSE detection complexity which is 0{N^Knt). This is 
illustrated numerically in Table HJ which shows the complexity 
comparison between MMSE and CHEMP-GSM detection. 

Performance: In Eig. [TT] we present the performance of the 
CHEMP-GSM detector at a spectral efficiency of 6 bpcu per 
user, K = 16, N = 64,128, assuming perfect CSI at the 
receiver. Eigure [H] compares the performance of multiuser 
GSM-MIMO system with (nt = 4, = 2, 4-QAM), with 
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Fig. 12. BER performance comparison of multiuser GSM-MIMO using 
CHEMP-GSM detector and MMSE detector for K = 16, nt = 4, = 2, 

4-QAM at 4 bpcu per user, N = 64,128. 



Eig. 13. SNR required by MP-GSM, CHEMP-GSM, and MMSE detectors 
to achieve a BER of 10“^ with N = 128, nt = 4, n^-f = 2, 4-QAM, and 
6 bpcu per user, as a function of system loading factor KjN. 


that of SM-MIMO system with (n* = 4, n^/ = 1, 16- 
QAM) and M-MIMO system with (nt = 1, Urf = 1, 64- 
QAM). Detection in GSM-MIMO and SM-MIMO systems is 
done using CHEMP-GSM algorithm. Detection in M-MIMO 
system is done using sphere decoding. As observed in Sec. 
IIII-BI and Sec. lIV-Al in Fig. [TT] also we see that GSM-MIMO 
outperforms both SM-MIMO and M-MIMO. This is because 
of the smaller-sized QAM used in GSM-MIMO compared to 
those used in SM-MIMO and M-MIMO to achieve the same 
spectral efficiency. 

CHEMP-GSM Vi MMSE performance: Figure [12] shows the 
performance comparison between CHEMP-GSM detector and 
MMSE detector at a spectral efficiency of 6 bpcu per user, 
K = 16, n* = 4,71,./ = 2, and 4-QAM. It is observed 
that CHEMP-GSM detector outperforms MMSE detector by 
about 9 dB for A^ = 64 and 3 dB for A^ = 128 at a BER 
of 10“^. We note that CHEMP-GSM detector achieves this 
better performance at a lesser complexity compared to MMSE 
detector. This can be observed from Table HU 

Performance as a function of loading factor: In Fig. 1131 we 
compare the performance of different detectors for multiuser 
GSM-MIMO with W = 128, nt = 4, n,/ = 2, 4-QAM, and 6 
bpcu per user as a function of system loading factor K/N. We 
plot the average SNR required by MP-GSM, CHEMP-GSM 
and MMSE detectors with perfect CSI to achieve a BER of 
10“^. From Fig. [T3] we observe that the proposed message 
passing based detectors outperform the MMSE detector by 
about 2 to 3 dB at lower loading factors. This gap widens as 
the system loading factor increases; for example, the gap is 
about 3 dB at K/N = 0.125, and it widens to about 9 dB 
at K/N = 0.25. The SNR required significantly increases 
for high loading factors because the channel gain matrix 
which is of dimension N x Krit becomes under-determined 
for a: > Also, both MP-GSM and CHEMP-GSM algo¬ 
rithms perform almost same, with MP-GSM having a lesser 
complexity than CHEMP-GSM (see Table |II|). However, the 
matched filtered system model in CHEMP-GSM allows a 


simple channel estimation technique, which is presented in 
the following subsection. 


C. Estimation o/H^H 

In obtaining the performance results reported in the previous 
subsection, we have assumed perfect CSI at the receiver. Now, 
we relax this assumption. We present a channel estimation 
scheme suited for use in CHEMP-GSM detector. A conven¬ 
tional approach is to directly obtain an estimated channel 
matrix H through channel estimation techniques (MMSE 
channel estimation, for example) using pilot transmissions, and 
use H in place of H in detection algorithms. For the MMSE 
detector and MP-GSM detector, we follow this approach, 
and we call the receiver employing MMSE detector with 
MMSE channel estimate as ‘MMSE receiver’ and the receiver 
employing MP-GSM detector with MMSE channel estimate as 
‘MP-GSM receiver’. For CHEMP detector, however, instead 
of conventional approaches that estimate H, we directly obtain 
an estimate of the matrix J. The motivation for this approach 
is that H influences the proposed detection operation through 
J = H^H. 

Assume that the channel is slowly fading, and that the fade 
remains constant for one frame duration. Each frame has Lf 
channel uses, consisting of a pilot part and a data part. The 
pilot part consists of Knt channel uses, and the data part 
consists of Lf — Knt channel uses. Let Xp = denote 

the pilot matrix, where, in the jth channel use, 1 < j < 
Knt, the [—]th user terminal transmits a pilot symbol with 
amplitude A through its antenna whose index is given by ((j — 
1) mod nt) -f 1, and the other antennas remain silent. The 
signal received at the BS during pilot part is given by 


Yp = 


HX„ 


Wp = AH 


W 


P’ 


(37) 


/here A = y/KEg, Eg is the average symbol energy, and Wp 
5 the noise matrix. An estimate of the matrix J is obtained 


J = 


Y^Y n-2 

NA^ A2 


•-Kn, 


(38) 
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(a) BER vs SNR. (b) SNR reqd. for 10"^ BER vs K/N. 

Fig. 14. Performance of MMSE Rx, MP-GSM Rx, and CHEMP-GSM Rx 
in multiuser GSM-MIMO with N = 128, nt = 4, = 2, 4-/8-/16-QAM, 

and estimated CSI. 


where 


N ■ 


An estimate of the vector z is obtained as 


z = 


Yp^y 
NA ’ 


(39) 


where y is the received signal vector in the data phase. These 
estimates J and z in (l38l l and (l3^ are used in the CHEMP- 
GSM detection algorithm in place of J and z. 

Performance and complexity: In Figs. |14(a)| and |14(b)| we 
present performance comparisons between i) MMSE receiver 
(MMSE detector with MMSE channel estimate), ii) MP-GSM 
receiver (MP-GSM detector with MMSE channel estimate), 
and Hi) CHEMP-GSM receiver (CHEMP-GSM detector with 
the proposed estimate of J). Multiuser GSM-MIMO with N = 
128, rit = 4, Tirf = 2, and 4-/8-/16-QAM is considered. Figure 
|14(a)| shows the BER vs SNR plots for K = 16, i.e., loading 
factor K/N = 0.125. Figure |14(b)| shows SNR required to 
achieve 10“^ BER as a function of loading factor K/N. Table 
nni presents corresponding complexity comparison between 
the receivers considered. It can be observed that, because of 
the additional complexity of MMSE channel estimation, the 
complexities of MMSE receiver and MP-GSM receiver are 
more than the corresponding complexities of MMSE detector 
and MP-GSM detector in Table |II] However, the complexities 
of CHEMP-GSM receiver and CHEMP-GSM detector are the 
same. This is because the complexities of computing J and 
z are the same as those of computing J and z, respectively. 
From Figs. |14(^ and [T4(b)l it can be seen that the performance 
of the CHEMP-GSM receiver is the best among the three 
receivers considered. Also, the complexities of both MP-GSM 
and CHEMP-GSM receivers are much less compared to that of 
MMSE receiver, and the complexity of CHEMP-GSM receiver 
is a little higher than the complexity of MP-GSM receiver due 
to the additional computation of z in (l3^ . 


V. Multiuser GSM-MIMO in frequency selective 

FADING 

In this section, we assume the multiuser GSM-MIMO 
system model described in Section |II] except for the channel 


K 

Complexity in number of real operations xlO^, N = 128 

MMSE Rx. 

MP-GSM Rx. 

CHEMP-GSM Rx. 

16 

4.041 

2.466 

3.142 

32 

21.294 

4.925 

6.281 

64 

30.037 

10.831 

12.265 

96 

39.513 

16.746 

18.013 

128 

48.622 

20.961 

21.637 


TABLE III 

Comparison between the complexities (in number of real 
OPERATIONS) OF THE MP-GSM RECEIVER, CHEMP-GSM RECEIVER, 
MMSE RECEIVER IN GSM-MIMO WITH N = 128, nt = 4, = 2, 

AND 4-QAM. 

model which is now assumed to be frequency selective. Let 
L denote the number of multipath components between each 
pair of transmit antenna at the user and receive antenna at the 
BS. Let denote the channel gain from the jth 

transmit antenna of the fcth user to the ith BS receive antenna 
on the kh multipath component. The channel gains for the /th 
multipath component are modeled as complex Gaussian with 
zero mean and variance fl;. The power-delay profile of the 
channel is modeled as 

= / = (40) 

where ^ denotes the decay-rate of the average power in each 
of the multipath components in dB. The total power of the 
channel is assumed to be unity, i.e., = 1. We will 

further assume that the channel coefficients are estimated at 
the BS using a pilot transmission based channel estimation. 

CPSC transmission: We employ cyclic prefixed single car¬ 
rier (CPSC) transmission, which has the advantage of low 
peak-to-average power ratio (PAPR) IlMl . ll^ . Transmission 
is carried out in frames, where each frame consists of multiple 
blocks as shown in Fig. [15] The fade coefficients are assumed 
to be constant over one frame duration. Each frame consists of 
one pilot block (PB) meant for channel estimation, followed by 
I data blocks (DB). The pilot block consists of {L—l)+KntL 
channel uses. In the first L—1 channel uses in the pilot block, 
zeros are transmitted to avoid interference from previously 
transmitted frames. In each of the remaining KritL channel 
uses, a iTnt-length pilot symbol vector comprising of pilot 
symbols transmitted from K users (nt pilot symbols per user) 
is received by the N BS receive antennas. Each data block 
consists of Q + L — 1 channel uses, where Q information 
symbol vectors, each of length Knt, prefixed by [L — 1)- 
length cyclic prefix from each user are transmitted. With / 
data blocks in a frame, the number of channel uses in the data 
part of the frame is [Q L — 1)1. Taking both pilot and data 
channel uses into account, the total number of channel uses 
per frame is [Knt + 1)L + (Q -\- L — 1)1 — 1. 

Channel estimation: Let P be the power of the pilot 
symbol transmitted by each user. Then, in KntL channel 
uses, the pilot sequence p^' transmitted by the jth transmit 
antenna of the /cth user is given by the KutL-length vector 

Pj = [0((fc-l)nt+i-l)Lxl ’/P 0{i{K-k+l)nt-j+l)L-l)xl]- 

Let 

u . _ r rrCO) 1 

l)nt -\-j i,{k — l)nt+j i,{k — l)nt+j^' 
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Frame 1 I Frame 2 





PB 

DB-1 

DB-2 

-; 

DB-d 


DB-/ 


Userl 

L — 1 zeros A'/itL pilot symbols 

Userl 

L-l length CP| 

Q information symbols I 





User if 

L-l zeros if/i;;! pilot symbols 

User/r 

i-l length CP 

Q information symbols 


IDB 


Fig. 15. Frame structure of CPSC scheme for multiuser GSM-MIMO in 
frequency selective fading. 


The iTrit L-length vector received at the ith BS antenna in the 
pilot phase is then given by 


= [yl’^ yl’^ 

= Hi 


i,KntL- 

y? 

^\,{k-l)nt+o 


,,Knt 


’Vp- 


n„ 


(41) 


= h“ 


where yl'^ is the received signal at the ith BS antenna in the 
tth channel use of the pilot phase, and nj is the KntLA&ngih 
noise sequence at the ith BS antenna. An MMSE estimate of 
the channel gain vector h* can be obtained from (HTI) as 


h* 


P + Cr2 


(42) 


Signal detection: Each data block (DB) in a frame consists 
of a cyclic prefix (CP) followed by data symbols as shown 
in Eig. [15] Let denote the transmit vector from 

the fcth user in the fth channel use in a DB. The /cth user’s 
transmit vector in a DB is of the form 


^(Q-L+l)'r 






JQ-i)Pt 


CP 


Assuming perfect synchronization and discarding the CP at 
the BS receiver, the received signal vector can be written as 


y' = H'x' + n', (43) 

where y' is y^^^^ • • • y(*) e 

(jenotes the received vector at the fth channel use in 
a DB, x' is g ^KntQxi^ ^(t) g 

X1 js the vector comprising of transmit vectors of all the 
users in the fth channel use in a DB, H' is the channel gain 
matrix of dimension NQx KritQ, and n' is the additive white 
Gaussian noise vector given by g 

CNQ^i. The received signal vector in the tth channel use in 
a DB can be written as 

L-l 

yW = ^ f = 0,l,--- ,Q-1, (44) 

1=0 

where g i^NxKnt channel gain matrix correspond¬ 
ing to the hh multipath component such that 
represents the channel gain from the jth transmit antenna of 


the fcth user to the ith BS receive antenna in the hh multipath. 
Eor this system, the ML detection rule is given by 

x' = argmin||y'-HVf, (45) 

x'gG« 

where G = , and the exact computation of (1451) 

requires exponential complexity in KQ. We shall formulate 
the system model in (l43l l into an equivalent system model in 
the frequency domain, and employ the algorithms for signal 
detection (presented in Section llVb on the resulting equivalent 
system model. 

It is noted that because of the addition of CP, the matrix H' 
is a block circulant matrix. Therefore, H' can be transformed 
into a block diagonal matrix D as 

D = (F®Iiv)H'(F^®lK„J, (46) 


where I„ denotes n x n identity matrix, and F is the Q x Q 
DET matrix, given by 


F = 


1 


VQ 

.PQ-1,0 PQ-1,1 

where = exp(-j 


Po.o 

Pi.o 


Po,i 

Pi.i 


Po,Q-i 

Pi.Q-i 


the form 


D = 


Q ’■ 

Do 

0 


PQ-1,Q-1_ 

D is a block diagonal matrix of 

0 

(47) 


• Dq_i_ 

where Dg is of dimension N x Krit- The {i, (fc — l)nt + j)th 
element of Dg is the gth element of the DET of the vector 


,, , 77^)1 n , • • • 77^^". 0 • • • 0]G 

Performing DPT operation on the received vector y' at the 
receiver, we get 

z' = (F(g)Ijv)y'= (F(g)I^)HV + w', (48) 

where w' = (F ® lAr)n'. Further, z' can be written as 


( 1 ) 


(L-l) 


= D(F 0 Iktl 
= Hx' + w', 


)x' + - 


(49) 


where H = D(F 0 liCnt) is the equivalent channel. Now, 
detection can be performed on the system model in ( l49b 
using the MP-GSM and CHEMP-GSM algorithms presented 
in Section HVl 

Performance with perfect CSI: We evaluated the perfor¬ 
mance of multiuser GSM-MIMO CPSC systems in frequency 
selective channel with L = 3,Q = 6,^ = 3 dB, N = 128, 
and perfect CSI. MMSE, MP-GSM and CHEMP-GSM al¬ 
gorithms are used for GSM-MIMO signal detection. Figure 
[Tbl shows the performance comparison between GSM-MIMO 
with (rit = 4, Urf = 2, 4-QAM) and M-MIMO with {rit = 1, 
rirf = 1, 64-QAM), both having 6 bpcu per user and K = 16. 
LAS algorithm in is used for M-MIMO signal detection. 
From Fig. (Tb] we observe that GSM-MIMO outperforms M- 
MIMO by about 12 dB at a BER of lO’^. Also, MP-GSM 
and CHEMP-GSM detectors outperform MMSE detector by 
about 3 dB at a BER of IQ-^. 












































14 



Fig. 16. BER performance of multiuser GSM-MIMO with (nt = 4, n^f = 
2, 4-QAM) and M-MIMO with {nt = 1, Urf = 1, 64-QAM) for K = 16, 
N = 128, 6 bpcu per user, frequency selective fading with L = 3 and ^ = 3 
dB, CPSC transmission with (5 = 6, and perfect CSI. 



Fig. 17. BER performance of multiuser GSM-MIMO with {nt = 4, n^f = 
2, 4-QAM) and M-MIMO with {m = 1, n^f = 1, 64-QAM) for K = 16, 
N = 128, 6 bpcu per user, frequency selective fading with L = 3,^ = 3 
dB), CPSC transmission with ((5 = 6, L = 3, ^ = 3 dB), estimated CSI. 

Performance with estimated CSI: Figure [T3 shows the 
performance of GSM-MIMO CPSC systems with estimated 
CSI for the same system and channel parameters in Fig. 
[Thl MMSE receiver (MMSE detector with MMSE channel 
estimator), MP-GSM receiver (MP-GSM detector with MMSE 
channel estimate, and CHEMP-GSM receiver (CHEMP-GSM 
detector with the proposed estimate of J) are used for GSM- 
MIMO. LAS receiver (LAS detection with MMSE channel 
estimator) is used for M-MIMO. We observe that GSM-MIMO 
CPSC system performs better than M-MIMO CPSC system by 
about 11 dB at a BER of lO'^. 

VI. Conclusions 

We investigated generalized spatial modulation (GSM) for 
multiuser communication on the uplink in large-scale MIMO 
systems. We derived an analytical upper bound on the average 
bit error probability in multiuser GSM-MIMO systems with 


ML detection. The bound was shown to be tight at moderate- 
to-high SNRs. Numerical results showed that, for the same 
spectral efficiency, multiuser GSM-MIMO can outperform 
conventional multiuser MIMO by several dBs. We also pro¬ 
posed low-complexity algorithms for multiuser GSM-MIMO 
signal detection and channel estimation at the BS receiver 
based on message passing. The performance of these proposed 
algorithms in large-scale GSM-MIMO systems with tens of 
users and hundreds of BS antennas showed that multiuser 
GSM-MIMO can outperform conventional multiuser MIMO. 
The SNR advantage of GSM-MIMO over conventional MIMO 
is attributed to the following reasons: (i) because of the spatial 
index bits, to achieve the same spectral efficiency, GSM- 
MIMO can use a lower-order QAM alphabet compared to 
that in conventional MIMO, and (ii) to achieve same spectral 
efficiency and QAM size, conventional MIMO will need more 
spatial streams per user which results in increased spatial 
interference. This performance advantage along with low 
RE hardware complexity makes large-scale multiuser GSM- 
MIMO very attractive. We further note that the SM concept 
has recently been validated with the aid of experimental 
activities in indoors and outdoors EH,ESI. These practical 
advancements in SM and the performance advantage in GSM- 
MIMO suggest that large-scale multiuser GSM-MIMO is an 
attractive technology for future wireless systems like 5G. 
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